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RAW SEQUENCE LISTING DATE : 11/12/2001 

PATENT APPLICATION: US/09/554 , 414B TIME: ^8:23:51"" 

Input Set : A:\corrected Sequence listing.txt 
Output Set: N:\CRF3\11122001\I554414B.raw 

4 <110> APPLICANT: McGILL UNIVERSITY 

5 SZYF, Moshe 

6 BHATTACHARYA, San joy K . 

7 RAMCHANDANI , Shyam 

10 <120> TITLE OF INVENTION: DNA DEMETHYLASE, THERAPEUTIC AND 

11 DIAGNOSTIC USES THEREOF 
13 <130> FILE REFERENCE: 1770 - 183 " PCT" FC/ld 

C--> 15 <140> CURRENT APPLICATION NUMBER: US/09/554 , 414 B ^d- 
C--> 15 <141> CURRENT FILING DATE: 2000-09-06 

15 <150> PRIOR APPLICATION NUMBER: CA 2,220,805 

16 <151> PRIOR FILING DATE: 1997-11-12 » 

18 <150> PRIOR APPLICATION NUMBER: CA 2,230,991 F 

19 <151> PRIOR FILING DATE: 1998-05-11 
21 <160> NUMBER OF SEQ ID NOS : 10 

23 <170> SOFTWARE: FastSEQ for Windows Version 3.0 

25 <210> SEQ ID NO: 1 

26 <211> LENGTH: 1804 

27 <212> TYPE: DNA 

28 <213> ORGANISM: Unknown 

30 <220> FEATURE: 

31 <223> OTHER INFORMATION: cDNA encoding human demethylase 

33 <400> SEQUENCE: 1 

34 ccgctctgcg ggcggggcgg gtctccggga ttccaagggc tcggttacgg aagaagcgca 60 

35 gagccggctg gggagggggc tggatgcgcg cgcacccggg gggaggccgc tgctgcccgg 120 

36 agcaggagga gggggagagc gcggcgggcg gcagcggcgc tggcggcgac tccgccatag 180 

37 agcagggggg ccagggcagc gcgctcgctc cgtccccggt gagcggcgtg cgcagggaag 240 

38 gcgctcgggg cggcggccgt ggccgggggc ggtggaagca ggcggcccgg ggcggcggcg 300 

39 tctgtggccg tggccgtggc cgtggccggg gtcggggccg tggccggggc cggggccggg 360 

40 gccgcggccg tccccagagt ggcggcagcg gccttggcgg cgacggcggc ggcggcgcgg 420 

41 gcggctgcgg cgtcggcagc ggtggcggcg tcgccccccg gcgggatcct gtccctttcc 4 80 

42 cgtcggggag ctcggggccg gggcccaggg gaccccgggc cacggagagc gggaagagga 540 

43 tggactgccc ggccctcccc cccggatgga agaaggagga agtgatccga aaatcagggc 600 

44 tcagtgctgg caagagcgat gtctactact tcagtccaag tggtaagaag ttcagaagta 660 

45 aacctcagct ggcaagatac ctgggaaatg ctgttgacct tagcagtttt gacttcagga 720 

46 ccggcaagat gatgcctagt aaattacaga agaacaagca gagactccgg aatgaccccc 780 

47 tcaatcagaa caagggtaaa ccagacctga acacaacatt gccaattaga caaactgcat 840 

48 caattttcaa gcaaccagta accaaattca cgaaccaccc gagcaataag gtgaagtcag 900 

49 acccccagcg gatgaatgaa caaccacgtc agcttttctg ggagaagagg ctacaaggac 960 

50 ttagcgcatc agatgtaaca gaacaaatta taaaaaccat ggagctacct aaaggtcttc 1020 

51 aaggagtcgg tccaggtagc aatgacgaga cccttctgtc tgctgtggcc agtgctttac 1080 

52 acacaagctc tgcgcccatc acaggacaag tctctgctgc cgtggaaaag aaccctgctg 1140 

53 tttggcttaa cacatctcaa cccctctgca aagctttcat tgttacagat gaagacatta 1200 

54 ggaaacagga agagcgagtc caacaagtac gcaagaaact ggaggaggca ctgatggccg 1260 

55 acatcctgtc ccgggctgcg gacacggagg aagtagacat tgacatggac agtggagatg 1320 

56 aggcgtaaga atatgatcag gtaactttcg actgaccttc cccaagagca aattgctaga 1380 

57 aacagaatta aaacatttcc actgggtttc gcctgtaaga aaaagtgtac ctgagcacat 1440 

58 agctttttaa tagcactaac caatgccttt ttagatgtat ttttgatgta tatatctatt 1500 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/554 , 414B 



DATE: 11/12/2001 
TIME: 18:23:51 



Input Set : A:\corrected Sequence listing.txt 
Output Set: N:\CRF3\11122001\I554414B.raw 



59 attccaaatg atgtttattt tgaatcctag gacttaaaat gagtctttta taatagcaag 

60 cagggccctt ccggtgcagt gcagctttga ggccaggtgc agtctactgg aaaggtagca 

61 cttacgtgaa atatttgttt cccccacagt tttaatataa acagatcagg agtaccaaat 

62 aagtttccca attaaagatt attatacttc actgtatata aacagatttt tatactttat 

63 tgaaagaaga tacctgtaca ttcttccatc atcactgtaa agacaaataa atgactatat 

64 tcac 

66 <210> SEQ ID NO: 2 

67 <211> LENGTH: 411 

68 <212> TYPE: PRT 

69 <213> ORGANISM: Unknown 

71 <220> FEATURE: 

72 <223> OTHER INFORMATION: predicted amino acid of human demethylase 
74 <400> SEQUENCE: 2 



75 


Met 


Arg 


Ala His Pro Gly Gly 


Gly Arg Cys Cys Pro Glu Gin Glu Glu 


76 


1 




5 


10 15 


77 


Gly 


Glu 


Ser Ala Ala Gly Gly 


Ser Gly Ala Gly Gly Asp Ser Ala He 


78 






20 


25 30 


79 


Glu 


Gin 


Gly Gly Gin Gly Ser 


Ala Leu Ala Pro Ser Pro Val Ser Gly 


80 






35 


40 45 


81 


Val 


Arg 


Arg Glu Gly Ala Arg 


Gly Gly Gly Arg Gly Arg Gly Arg Trp 


82 




50 


55 


60 


83 


Lys 


Gin 


Ala Gly Arg Gly Gly 


Gly Val Cys Gly Arg Gly Arg Gly Arg 


84 


65 




70 


75 80 


85 


Gly 


Arg 


Gly Arg Gly Arg Gly 


Arg Gly Arg Gly Arg Gly Arg Gly Arg 


86 






85 


90 95 


87 


Pro 


Pro 


Ser Gly Gly Ser Gly 


Leu Gly Gly Asp Gly Gly Gly Cys Gly 


88 






100 


105 110 


89 


Gly 


Gly 


Gly Ser Gly Gly Gly 


Gly Ala Pro Arg Arg Glu Pro Val Pro 


90 






115 


120 125 


91 


Phe 


Pro 


Ser Gly Ser Ala Gly 


Pro Gly Pro Arg Gly Pro Arg Ala Thr 


92 




130 


135 


140 


93 


Glu 


Ser 


Gly Lys Arg Met Asp 


Cys Pro Ala Leu Pro Pro Gly Trp Lys 


94 


145 




150 


155 160 


95 


Lys 


Glu 


Glu Val lie Arg Lys 


Ser Gly Leu Ser Ala Gly Lys Ser Asp 


96 






165 


170 175 


97 


Val 


Tyr 


Tyr Phe Ser Pro Ser 


Gly Lys Lys Phe Arg Ser Lys Pro Gin 


98 






180 


185 190 


99 


Leu 


Ala 


Arg Tyr Leu Gly Asn 


Thr Val Asp Leu Ser Ser Phe Asp Phe 


100 






195 


200 205 


101 


Arg 


Thr 


Gly Lys Met Met Pro 


Ser Lys Leu Gin Lys Asn Lys Gin Arg 


102 




210 


215 


220 


103 


Leu 


Arg 


Asn Asp Pro Leu Asn 


Gin Asn Lys Gly Lys Pro Asp Leu Asn 


104 


225 




230 


235 240 


105 


Thr 


Thr 


Leu Pro lie Arg Gin 


Thr Ala Ser He Phe Lys Gin Pro Val 


106 






245 


250 255 


107 


Thr 


Lys 


Val Thr Asn His Pro 


Ser Asn Lys Val Lys Ser Asp Pro Gin 


108 






260 


265 270 


109 


Arg 


Met 


Asn Glu Gin Pro Arg 


Gin Leu Phe Trp Glu Lys Arg Leu Gin 


110 






275 


280 285 



1560 
1620 
1680 
1740 
1800 
1804 
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RAW SEQUENCE LISTING DATE: 11/12/2001 

PATENT APPLICATION: US/09/554, 414B TIME: 18:23:51 

Input Set : A:\corrected Sequence listing.txt 
Output Set: N:\CRF3\11122001\I554414B.raw 

111 Gly Leu Ser Ala Ser Asp Val Thr Glu Gin He He Lys Thr Met Glu 

112 290 295 300 

113 Leu Pro Lys Gly Leu Gin Gly Val Gly Pro Gly Ser Asn Asp Glu Thr 

114 305 310 315 320 

115 Leu Leu Ser Ala Val Ala Ser Ala Leu His Thr Ser Ser Ala Pro He 

116 325 330 335 

117 Thr Gly Gin Val Ser Ala Ala Val Glu Lys Asn Pro Ala Val Trp Leu 

118 340 345 350 

119 Asn Thr Ser Gin Pro Leu Cys Lys Ala Phe He Val Thr Asp Glu Asp 

120 355 360 365 

121 He Arg Lys Gin Glu Glu Arg Val Gin Gin Val Arg Lys Lys Leu Glu 

122 370 375 380 

12 3 Glu Ala Leu Met Ala Asp He Leu Ser Arg Ala Ala Asp Thr Glu Glu 

124 385 390 395 400 

125 Met Asp He Glu Met Asp Ser Gly Asp Glu Ala 

126 405 410 

128 <210> SEQ ID NO: 3 

129 <211> LENGTH: 1589 

130 <212> TYPE: DNA 

131 <213> ORGANISM: Unknown 

133 <220> FEATURE: 

134 <223> OTHER INFORMATION: cDNA sequence of human dMTase2 

136 <400> SEQUENCE: 3 

137 cacgcgcggg cgggtgggcg gagcggcccc cctagcgggg gctgtgaagc gcggggaggg 

138 ggccgagcgg gtggcgaagc cggcgcgcgc ccggctgggg gcggagggcg gaggcccgtg 120 

139 ggacagaaca gctgcggcga gtggcggcgg cggagggagc cgaatcggcg acgagcccgg 180 

140 gggtcgcaac ttgcagaagc ggcggcggcg gcggcatcgg ccacggcggg cggaaaagcc 240 

141 ggggcgcaat ggagcggaag aggtgggagt gcccggcgct cccgcagggc tgggaaaggg 300 

142 aagaagtgcc caggaggtcg gggctgtcgg ccggccacag ggatgtcttt tactatagcc 360 

143 ccagcgggaa gaagttccgc agcaagccac aactggcacg ttacctgggc ggatccatgg 420 

144 acctcagcac cttcgacttc cgcaccggaa agatgttgat gaacaagatg aataagagtc 480 

145 gccagcgtgt gcgctatgat tcttccaacc aggtcaaggg caagcctgac ctgaacaccg 540 

146 cgctgcctgt acggcagact gcatccatct tcaagcaacc ggtgaccaag atcaccaacc 600 

147 accccagcaa caaggtcaag agcgacccgc agaaggcagt ggaccagccg aggcagcttt 660 
14 8 tctgggagaa gaagctaagt ggattgagtg cctttgacat tgcagaagaa ctggtcagga 720 

149 ccatggactt gcccaagggc ctgcagggag tgggccctgg ctgtacagat gagacgctgc 780 

150 tgtcagccat tgcgagtgct ctacacacca gcaccctgcc cattacaggc cagctctctg 84 0 

151 cagccgtgga gaagaaccct ggtgtgtggc tgaacactgc acagccactg tgcaaagcct 900 

152 tcatggtgac agatgacgac atcaggaagc aggaggagct ggtacagcag gtacggaagc 960 

153 gcctggagga ggcactgatg gccgacatgc tagctcatgt ggaggagctt gcccgagacg 1020 

154 gggaggcacc actggacaag gcctgtgcag aggaggaaga ggaggaggaa gaggaggagg 1080 

155 aagagccgga gccagagcga gtgtagcaca ggtgccctgc ccaagtctgg gctgcagact 1140 

156 gccttcagcc ttgcctggac caggtagggg ccagacctgt aggaggcagc cgtccacctc 1200 

157 ctttccaaag cctcctgctt ccaggtctca gtgcagggag cccctgtgga ccttgaactc 1260 

158 acttgtccct gcgctgcctg gcaggaagcc ccacactgaa agcagatgag cagtgaccca 1320 

159 actgagaggc cacctggaca cagtcacctc cctgcctcct tatcatagga caaggccttg 1380 

160 cttggcaccg aggagctggg agccgtgttg ggtgctggag gaagtttctg gaaacacacc 1440 

161 tggctatgcc caccttatgt ccctaaggct attacaggcc agggtttgga ctgctccggc 1500 

162 ccacagggct gcccagcctc cccacactga gggtcagcag cccaccagga agtcactttc 1560 



60 
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RAW SEQUENCE LISTING DATE : 11/12/2001 

PATENT APPLICATION: US/09/554 , 414B TIME: 18:23:51 

Input Set : A:\corrected Sequence listing.txt 
Output Set: N:\CRF3\11122001\I554414B.raw 

163 cttcaataaa ctgatggtag gaacttgtg 1589 

165 <210> SEQ ID NO: 4 

166 <211> LENGTH: 291 

167 <212> TYPE: PRT 

168 <213> ORGANISM: Unknown 

170 <220> FEATURE: 

171 <223> OTHER INFORMATION: predicted amino acid sequence of human dMTase2 

173 <400> SEQUENCE: 4 

174 Met Glu Arg Lys Arg Trp Glu Cys Pro Ala Leu Pro Gin Gly Trp Glu 

175 15 10 15 

176 Arg Glu Glu Val Pro Arg Arg Ser Gly Leu Ser Ala Gly His Arg Asp 

177 20 25 30 

17 8 Val Phe Tyr Tyr Ser Pro Ser Gly Lys Lys Phe Arg Ser Lys Pro Gin 

179 35 40 45 

180 Leu Ala Arg Tyr Leu Gly Gly Ser Met Asp Leu Ser Thr Phe Asp Phe 

181 50 55 60 

182 Arg Thr Gly Lys Met Leu Met Ser Lys Met Asn Lys Ser Arg Gin Arg 

183 65 70 75 80 

184 Val Arg Tyr Asp Ser Ser Asn Gin Val Lys Gly Lys Pro Asp Leu Asn 

185 85 90 95 

186 Thr Ala Leu Pro Val Arg Gin Thr Ala Ser He Phe Lys Gin Pro Val 

187 100 105 HO 

188 Thr Lys He Thr Asn His Pro Ser Asn Lys Val Lys Ser Asp Pro Gin 

189 115 120 125 

190 Lys Ala Val Asp Gin Pro Arg Gin Leu Phe Trp Glu Lys Lys Leu Ser 

191 130 135 140 

192 Gly Leu Asn Ala Phe Asp He Ala Glu Glu Leu Val Lys Thr Met Asp 

193 145 150 155 160 

194 Leu Pro Lys Gly Leu Gin Gly Val Gly Pro Gly Cys Thr Asp Glu Thr 

195 165 170 175 

196 Leu Leu Ser Ala He Ala Ser Ala Leu His Thr Ser Thr Met Pro He 

197 180 185 190 

198 Thr Gly Gin Leu Ser Ala Ala Val Glu Lys Asn Pro Gly Val Trp Leu 

199 195 200 205 

200 Asn Thr Thr Gin Pro Leu Cys Lys Ala Phe Met Val Thr Asp Glu Asp 

201 210 215 220 

202 He Arg Lys Gin Glu Glu Leu Val Gin Gin Val Arg Lys Arg Leu Glu 

203 225 230 235 240 

204 Glu Ala Leu Met Ala Asp Met Leu Ala His Val Glu Glu Leu Ala Arg 

205 245 250 255 

206 Asp Gly Glu Ala Pro Leu Asp Lys Ala Cys Ala Glu Asp Asp Asp Glu 

207 260 265 270 

208 Glu Asp Glu Glu Glu Glu Glu Glu Glu Pro Asp Pro Asp Pro Glu Met 
2 °9 275 280 285 

210 Glu His Val 

211 290 

213 <210> SEQ ID NO: 5 

214 <211> LENGTH: 1966 

215 <212> TYPE: DNA 
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RAW SEQUENCE LISTING DATE: 11/12/2001 

PATENT APPLICATION: US/09/554 , 414B TIME: 18:23:51 

Input Set : A:\corrected Sequence listing.txt 
Output Set: N:\CRF3\11122001\I554414B.raw 

216 <213> ORGANISM: Unknown 

218 <220> FEATURE: 

219 <223> OTHER INFORMATION: cDNA sequence of mouse dMTasel 

221 <400> SEQUENCE: 5 

222 gggggcgtgg ccccgagaag gcggagacaa gatggccgcc catagcgctt ggaggaccta 60 

223 agaggcggtg gccggggcca cgccccgggc aggagggccg ctctgtgcgc gcccgctcta 120 

224 tgatgcttgc gcgcgtcccc cgcgcgccgc gctgcgggcg gggcgggtct ccgggattcc 180 

225 aagggctcgg ttacggaaga agcgcagcgc cggctgggga gggggctgga tgcgcgcgca 240 

226 cccgggggga ggccgctgct gcccggagca ggaggagggg gagagtgcgg cgggcggcag 300 

227 cggcgctggc ggcgactccg ccatagagca ggggggccag ggcagcgcgc tcgccccgtc 360 

228 cccggtgagc ggcgtgcgca gggaaggcgc tcggggcggc ggccgtggcc gggggcggtg 420 

229 gaagcaggcg ggccggggcg gcggcgtctg tggccgtggc cggggccggg gccgtggccg 480 

230 gggacgggga cggggccggg gccggggccg cggccgtccc ccgagtggcg gcagcggcct 540 

231 tggcggcgac ggcggcggct gcggcggcgg cggcagcggt ggcggcggcg ccccccggcg 600 

232 ggagccggtc cctttcccgt cggggagcgc ggggccgggg cccaggggac cccgggccac 660 

233 ggagagcggg aagaggatgg attgcccggc cctccccccc ggatggaaga aggaggaagt 720 

234 gatccgaaaa tctgggctaa gtgctggcaa gagcgatgtc tactacttca gtccaagtgg 780 

235 taagaagttc agaagcaagc ctcagttggc aaggtacctg ggaaatactg ttgatctcag 840 

236 cagttttgac ttcagaactg gaaagatgat gcctagtaaa ttacagaaga acaaacagag 900 

237 actgcgaaac gatcctctca atcaaaataa gggtaaacca gacttgaata caacattgcc 960 

238 aattagacaa acagcatcaa ttttcaaaca accggtaacc aaagtcacaa atcatcctag 1020 

239 taataaagtg aaatcagacc cacaacgaat gaatgaacag ccacgtcagc ttttctggga 1080 

240 gaagaggcta caaggactta gtgcatcaga tgtaacagaa caaattataa aaaccatgga 1140 

241 actacccaaa ggtcttcaag gagttggtcc aggtagcaat gatgagaccc ttttatctgc 1200 

242 tgttgccagt gctttgcaca caagctctgc gccaatcaca gggcaagtct ccgctgctgt 1260 

243 ggaaaagaac cctgctgttt ggcttaacac atctcaaccc ctctgcaaag cttttattgt 1320 

244 cacagatgaa gacatcagga aacaggaaga gcgagtacag caagtacgca agaaattgga 1380 

245 agaagcactg atggcagaca tcttgtcgcg agctgctgat acagaagaga tggatattga 1440 

246 aatggacagt ggagatgaag cctaagaata tgatcaggta actttcgacc gactttcccc 1500 

247 aagrgaaaat tcctagaaat tgaacaaaaa tgtttccact ggcttttgcc tgtaagaaaa 1560 

248 aaaatgtacc cgagcacata gagcttttta atagcactaa ccaatgcctt tttagatgta 1620 

249 tttttgatgt atatatctat tattcaaaaa atcatgttta ttttgagtcc taggacttaa 1680 

250 aattagtctt ttgtaatatc aagcaggacc ctaagatgaa gctgagcttt tgatgccagg 1740 

251 tgcaatctac tggaaatgta gcacttacgt aaaacatttg tttcccccac agttttaata 1800 

252 agaacagatc aggaattcta aataaatttc ccagttaaag attattgtga cttcactgta 1860 

253 tataaacata tttttatact ttattgaaag gggacacctg tacattcttc catcatcact 1920 

254 gtaaagacaa ataaatgatt atattcacaa aaaaaaaaaa aaaaaa 1966 

256 <210> SEQ ID NO: 6 

257 <211> LENGTH: 414 

258 <212> TYPE: PRT 

259 <213> ORGANISM: Unknown 

261 <220> FEATURE: 

262 <223> OTHER INFORMATION: predicted amino acid sequence of mouse dMTasel 

264 <400> SEQUENCE : 6 

265 Met Arg Ala His Pro Gly Gly Gly Arg Cys Cys Pro Glu Gin Glu Glu 

266 15 10 15 

267 Gly Glu Ser Ala Ala Gly Gly Ser Gly Ala Gly Gly Asp Ser Ala lie 

268 20 25 30 

269 Glu Gin Gly Gly Gin Gly Ser Ala Leu Ala Pro Ser Pro Val Ser Gly 
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VERIFICATION SUMMARY DATE: 11/12/2001 

PATENT APPLICATION: US/09/554 , 414B TIME: 18:23:52 

Input Set : A:\corrected Sequence listing.txt 
Output Set: N:\CRF3\11122001\I554414B.raw 

:270 C: Current Application Number differs, Replaced Current Application No 
:271 C: Current Filing Date differs, Replaced Current Filing Date 
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